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Abstract 

In this paper, the proportionally fair allocation of time slots in a frame, as well as power level to 
multiple receivers in an energy harvesting broadcast system, is considered. Energy harvest times in a 
frame are assumed to be known at the beginning of that frame. The goal is to solve an optimization 
problem designed to maximize a throughput-based utility function that provides proportional fairness 
among users. An optimal solution of the problem was obtained by using a Block Coordinate Descent 
(BCD) method in earlier work (presented in Part I of this study). However, finding the optimal allocation 
entails a computational complexity that increases sharply in terms of the number of users or slots. In 
this paper, certain structural characteristics of the optimal power-time allocation policy are derived. 
Building on those, two simple and computationally scalable heuristics, PTF and ProNTO are proposed. 
Simulation results suggest that PTF and ProNTO can closely track the performance of the BCD solution. 
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I. Introduction 



With increasing awareness of the potential harmful effects to the environment caused by 
"greenhouse gas" emissions and the depletion of non-renewable energy sources, there is a 
growing consensus on the need to develop more energy-efficient communication systems [[T||. 
Whether the objective is to decrease the carbon footprint of wireless communications or to 
make nodes of a wireless network energy-wise self-sufficient, harvesting ambient energy is 
a promising approach for wireless communications. Ambient energy sources include sunlight, 
heat differentials, mechanical vibration, RF radiation, or any other physical source that can 
produce an electrical charge through a transducer (photovoltaic cell , piezoelectric element, 
etc.). Communication devices that can be powered by rechargeable batteries which store energy 
harvested through such means are already commercially available. However, harvested power 
is typically irregular and can at times fall short providing typical power consumption levels in 
wireless nodes. If it is desired for energy harvesting systems to match the performances their 
regular battery or grid-powered counterparts, the need to accomplish this by efficiently utilizing 
an unsteady power source opens up new challenges for the design of transmission as well as 
resource allocation schemes. 

There has been a considerable amount of recent research effort on optimizing data transmission 
with an energy harvesting transmitter. A single-user communication system operating with an 
energy harvesting transmitter is considered in [2], where a packet scheduling scheme that 
minimizes the time by which all of the packets are delivered to the receiver is obtained. A 
multi-user extension of [0 has also been considered in [|3l, BH and the same time minimization 
problem is solved for a two user broadcast channel. These approaches are exended in [|5l and [|6l 
to the case of a transmitter with a finite capacity battery. [)7l extends [|2l to propose the directional 
water-filling algorithm that finds the optimal energy management schemes for energy harvesting 
systems operating in fading channels, with finite capacity rechargeable batteries. In [8], the 
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authors consider the problem of energy allocation over a finite horizon for point-to-point wireless 
communications and use dynamic programming and convex optimization techniques to obtain 
the throughput-optimal energy allocation. 

The first part of our study on this problem [0 differs from the above-mentioned studies 
(and others cited in [9J) particularly in its aim to maximize the throughput in a proportionally 
fair way, taking into account the inherent differences of channel quality among users. In [j9J, 
we considered allocating among users the transmission power and the proportion of the time 
between energy harvests, to achieve a good balance between throughput and fairness on an energy 
harvesting downlink. Specifically, a proportional fairness based utility maximization problem in 
a time-sharing multi-user additive white Gaussian noise (AWGN) broadcast channel, where the 
transmitter is capable of energy harvesting is considered. The aim is to achieve the optimum 
off-line schedule, by assuming that the energy arrival profile at the transmitter is deterministic 
and known ahead of time in an off-line manner for a time window, called frame, i.e., the energy 
harvesting times and the corresponding harvested energy amounts are known at the beginning 
of each frame. The treatment in [|9l considers the general case in which the interarrival times 
between consequtive harvests do not have to be equal. Here, we focus on the case where energy 
interarrival times are equal. Not all generality is lost, because harvest amounts are arbitrary and 
the absence of a harvest in a certain slot can be expressed with a harvest of amount zero for the 
respective slot. Periodic sampling of harvests is also consistent with practice as in many energy 
harvesting systems, transmitters have supercapacitors that can store the harvested energy and 
supply in every predetermined time window, allowing the case of periodic energy arrivals. 

In this paper, we show that by using the periodic energy arrivals assumption, it is possible 
to analytically derive the characteristics of the optimal solution of the Problem proposed in [|9l. 
In [m, we proved that the problem in hand is a biconvex problem and has multiple local optima. 
This allowed us to decompose the problem into two parts (power allocation, time allocation) 
and present a Block Coordinate Descent based optimization algorithm, BCD [|9l, that converges 
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to a partial optimal solution. We believe that the partial optima found by BCD algorithm is 
very close to the local optima of the problem and thus, it achieves optimal or close to optimal 
utility im. Although BCD is guaranteed to converge to a partial optimal solution and thus the 
partial optimal utility, it is computationally expensive and when there are tens of users and energy 
arrivals, forming invertible hessian matrices (needed for the optimization of the power variables) 
may be computationally excessive. Hence in this paper, we first derive the characteristics of the 
optimal solution and then, build on those to develop simple heuristics, PTF and ProNTO that 
closely track the performance of the BCD solution. 

We start by describing the system model in the next section. Next, we make the problem 
statement precise in Section [nil Section |IV] discusses the structure and properties of the optimal 
solution. Depending on these properties, PTF and ProNTO heuristics are proposed in Section |V] 
In Section IVIIl we present our numerical and simulation results. We conclude in Section IVIIII 
with an outline of future directions. 

II. System Model 

Consider a time-slotted system where each frame, of length Fi, is divided into K slots. There 
is a single transmitter that transmits to users by time sharing. Channel conditions remain 
constant during {gn, the gain of user n, is chosen to be constant throughout the frame). The 
transmitter is equipped with a rechargeable battery such that some energy, Eu, is harvested from 
the environment at the beginning of each time slot t of frame i. The length of the t*'^ slot of 
frame i may be represented as Tu. However, as we are interested in a specific frame, we drop 
the frame indicator i and define the harvested energy in slot t as Ft, and, the length of slot t as 
Tt. Note that, we use the same system model as in [9]. However, unlike flUl, in this paper we 
assume periodic energy arrivals and hence equal slot lengths (T^ = T for all t = 1, . . . , K), as 
shown in Figure [Tl to reveal the characteristics of the optimal solution of Problem \T\ 

Similar to the setting in [|9l, for a given frame, the transmitter chooses a power level pt and 
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a time allocation vector rj = (tu, ...,r7vt), for each time slot t of the frame, where pnt = Pt 
is the selected transmission power for user n during slot t and, Tnt is the time allocated for 
transmission to user n during slot t. 

III. Problem Statement 

The total achievable number of bits sent to user n within one frame (proportional to the 
throughput obtained by that user in the frame), is J2f=i'''ntW \og2 ^1 + [HI- We aim to 

maximize a utility function, the log-sum throughput over the users J2n=i^'^92{Rn), which is 
known to achieve proportional fairness [[TOl , in the presence of energy harvesting. We start with 
the problem of interest. Problem [T] defined in [|9l. This is a constrained optimization problem 
that aims to maximize the utility function with respect to the time and energy constraints. 



Problem 1: 



N / K , 

Maximize: C/(r, v) = Y. l^Ss E l^Ss ( 1 + 

n—\ \t—\ ^ ^ 

subject to: T„t > , pt > (1) 



n=l 
K 

5ZT„t>e (3) 
t=i 
t t 



i=l 1=1 



where t = 1, ...,K and n = 1, ...,N. W is the bandwidth for a single link channel, and No 
is the power spectral density of the background noise. Hence, represents the SNR of user 
n in slot t. Equations in ([T]) represent the nonnegativity constraints. The time-limit constraints, 
the set of equations in ensure that the total time allocated to users does not exceed the 
slot length. The set of equations in ([3]), on the other hand, are technical constraints and ensure 
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that every user gets some time (> e where e is an infinitely small number) during the frame. 
The set of equations in dH) are called energy causality constraints as these ensure no energy is 
transmitted before becoming available. 

One might hope that this problem has a unique solution and no local optima except for one 
global optimum. Unfortunately, © is a nonlinear non-convex problem with potentially multiple 
local optima. Indeed, in [9J, analysis of structural characteristics of the problem revealed that 
it can be formulated as a biconvex optimization problem, and that it has multiple optima. In 
the next section, we decompose Problem [T] into two parts (power allocation, time allocation) to 
investigate and derive the characteristics of these optima. 

IV. Structure and Properties of the Optimal Solution 

In this section, we analyze the structure and properties of the hybrid power-time allocation 
policy. Remember that the utility function of Problem [T] is 

JV K 

U = J2^0g2{Y,'^ntRnt) (5) 
n=l t=l 

where Rnt represents the rate of link n in t*'^ slot: 

Rnt = Wlog2 (1 + LnPt) where L„ = (6) 
Let Rn = [Rni Rn2 ■ ■ ■ RukY ^^d 7^ = ■■■ t^kY ■ Then, utility can be rewritten as 

N 

U = Y,^og2{rfR;,) (7) 

n=l 

= Ui + U2 + ... + Um (8) 

where f/„, the utility of user n, is 
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Un = log2{TjRn) (9) 

In order to reveal characteristics related to the optimal solution that will help us develop 
computationally efficient and close-to-optimal heuristics, we decompose the problem into two 
parts (similarly as in [9J): power allocation and time allocation. 

A. Structure of an Optimal Power Allocation Policy 

In this section, we analyze the structure and properties of the optimal power allocation policy. 
In order to do this, we assume that the time allocation is determined, and try to characterize the 
structure of the optimal solution of the power allocation problem for this time allocation. Clearly, 
when the only variables are power variables. Problem [1] reduces to the following constrained 
optimization problem: 



Problem 2: 


N 

Maximize: U((p) = ^ Un{p) 

n=l 






subject to: pt > 


(10) 




t t 

i=l i=l 


(11) 



where t = 1, K and, f/„ is a function of thepower variables (as defined in Eq. (|9])). In our 
previous work [|9l, we proved the strict convexityu of Problem [21 Similarly, the general problem. 
Problem [H is shown to be a biconvex optimization problem that has many local minima [[91 ■ 
As Problem [21 has a unique optimum, the optimal power allocation changes for every given 

'Maximizing U{p) is equivalent to minimizing -~U{p) wiiicii is a convex objective function. 
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time allocation. In Theorem [H we claim that one of the optimum schedules of Problem \T\ has a 
nondecreasing power schedule. Lemma [1] not only helps us to prove our claim but also reveals 
that Problem [T] has multiple optima. From the proof Lemma [T] the attentive reader can observe 
that any feasible permutatioiu of the optimal schedule (r*,p*), described in Theorem [T] is also 
optimal. 

Theorem 1: When all slots have equal length (Tj = T, for \fj E {1, K}), there exists an 
optimal schedule such thatp* is nondecreasing, (e.g., p* = (pi, ...jPk) where Pi < P2 ^ 

... < pk). 

Proof: The proof is provided in Appendix lAl and rests on Lemma [1] below. ■ 
We shall need the following definition of a permutation of a vector sorted in nondecreasing order 
of elements, for stating Lemma \T\ 

Definition 1: Given a vector = [Rni Rn2 ■■■ RnxV, we define Rn = [Rnn(i) Rn-K(2) ■ ■ ■ Rn-K(K)\' 
where rJ is a permutation (sorted in increasing order) of _R„, such that 



Rn-K{l) < ■ ■ ■ < Rmv{2) < • • • < RmriK) (12) 

Lemma 1: When all slots have equal length (Tj = T, for \fj E {l,...,K}), for any given 
schedule {t,Vc), we can find such r;^,-R^ (where R'^ = Rn ) that [t'^Y R'^ = Tn'^Rn for all 
n = 1, . . . , N; i.e., the utility, U, does not change. Hence, if (t^*, Rn*) is optimal, then (r^*, R'n*) 
is also optimal. 

Proof: The proof is provided in Appendix El ■ 

B. Structure of an Optimal Time Allocation Policy 

In this section, we assume that the power allocation through the slots is determined. Then, 
given that the power variables are known constants, we determine the structure and properties 

feasible permutation is any permutation of a given schedule that does not violate the constraints described in Eqns. (I)-©. 
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of the optimal time allocation policy. Since, the only variables are time variables, Problem [T] 
reduces to Problem |3l 



Problem 3: 


N 

Maximize: f7(r) — ^ J7„(r) 

n=l 






subject to: t„( > 


(13) 




N 
n=l 


(14) 




K 

t=i 


(15) 



where t = 1, ...,K, n = 1, N and, Un is a function of the time variables (as defined in Eq. 
(HI)). In Problem [3] is shown to be convex. Thus, the analysis can rely on KKT (Karush- 
Kuhn-Tucker) optimality conditions, which must be satisfied by the global optimum. We start 
by forming the Lagrangian function as follows: 

K N NK+N K K N 

j=l i=l j=NK+l t=l i=l n=l 

(16) 

where yu's are the Lagrange multipliers, and, the total number of constraint]^ is N{K + 1) + K. 
After defining the Lagrangian as in Eq. (fT6] ). one can construct the KKT conditions for the optimal 
solution. Due to space limitations, we do not list the conditions here but refer the interested 
reader to the associated technical report [TT] for the details. Please note that the optimal time 
allocation should jointly satisfy the set of equations that arise from KKT conditions. Clearly, 
as the number of users, A^, and, the number of slots, K, increase, the number of equations 

^There are K equality constraints and NK + N inequality constraints. 
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increases dramatically making it cumbersome to write analytical solutions. Therefore, for the 
sake of conciseness, we continue the analysis with the special case of two users and two slots 
which allows us to construct the characteristics of the optimal time allocation policy. 

Consider two consequtive slots with different power levels. Let us call the one with the least 
power the weak slot, and the one with the highest power the strong slot. When the slots have 
equal length (Ti = T2 = T), the optimal policy has the properties described in Lemma |2] 

Lemma 2: In an optimal schedule, time allocation over the two slots (of equal length) has the 
following properties: 

1) The weak slot is assigned to only one of the users. The strong slot, however, is shared 
between users. When both power levels are equal; if one slot is assigned to user 1 (user 
2), the other slot is assigned to user 2 (user 1). 

2) To whom the the weak slot will be assigned depends on two criteria: first, r„ = which 
is the ratio of user n's rate in the second slot to that in the first, and second, whether the 
strong slot is before or after the weak slot. When the weak slot preceeds the strong slot, 
it is assigned to the user with the smaller V. Otherwise (implying the decrease in power 
level), it is assigned to the user with the higher V. 

3) In a strong slot, the user that did not (or will not) receive any data in the weak slot is 
favored, i.e., more than half of the slot is assigned to that user. In order to preserve fairness, 
this favoring operation is done by considering Fi and 

Proof: The proof is provided in Appendix O ■ 

V. PTF Heuristic 

In this section, we develop a heuristic algorithm. Power- Time-Fair (PTF), based on the char- 
acteristics (discovered in the previous section) of an optimal power/time allocation schedule. The 
PTF algorithm operates as follows: 

1) For Power Allocation: Assign nondecreasing powers through the slots by using the energy 
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harvest statistics, as follows: 

a) From a slot, say i, to the next one i + 1: If harvested energy decreases, defer a A 
amount of energy from slot i to slot z + 1 to equalize the power levels. Do this until 
all powers are nondecreasing, and, form a virtual nondecreasing harvest order. 

b) By using the virtual harvest order, assign nondecreasing powers through the slots, 
i.e., in each slot, spend what you virtually harvested at the beginning of that slot. 

2) For Time Allocation: For the power allocation found in 1), let, Bnt = RntT be the number 
of bits that would be sent by user n if the whole slot (of length T) was allocated to that 
user. Assign the first slot to the user who has the maximum rate, Rnt, in that slot. For the 
other slots, apply the following: At the beginning of each slot, t E {2, . . . , K}, determine 
the user with the maximum (3 where, 

and, assign the whole slot to that user. If multiple users share the same (3, then, allocate 
the slot to the user with the best channel. 

Simulation results show that the performance of the PTF algorithm is close to the performance 
of the BCD algorithm. 

VI. Pronto heuristic 

In this section, we develop a fast and simple heuristic, ProNTO (Powers Nondecreasing - Time 
Ordered), based on the optimal power allocation related characteristics discovered in Section ITV-AI 
and the simulation results obtained by running BCD algorithm for periodic energy arrivals. The 
ProNTO algorithm operates as follows: 

1) For Power Allocation: Assign nondecreasing powers through the slots by using the energy 
harvest statistics, as done in part (1) of PTF algorithm. 
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2) For Time Allocation: Order the users, ui, . . . ,un, according to their channel quality 
and form a user priority vector, = [u\, . . . , uj^] where u\ represents the user with 
the best channel. As K > N, Allocate every user -^~™^(-^'^) ^^ots as follows: The first 
K-mod{K,N) ^Yoi'g, are allocated to u\, the next ^-"^"^(^'^^ g^ots are allocated to U2, etc. 
Add the remaining mod{K, N) slots to the most powerful mod(K, N) users' slots. For 
example; Let K = 12 and = 5, and the path losses of the users to be 13 dB, 17 dB, 
10 dB, 12 dB, 20 dB respectively. Then, the first 3 slots are allocated to user 3, the next 
3 slots are allocated to user 4, the following 2 slots are allocated to user 1, 9*'* and 10*'^ 
slots are allocated to user 2, and the last 2 slots are allocated to user 5. 

Thus PTF and ProNTO differ only in time allocation part. The time allocation method used 
in ProNTO is proposed according to the following observation: when a partial optimal solution 
obtained by BCD algorithm is modified as described in Lemma \T\ and its proof, to form the 
nondecreasing optimal schedule, the time allocation becomes ordered, e.g., as shown in Table 
Uni As time allocation method used in ProNTO is simpler than the one used in PTF, ProNTO 
can operate faster. Simulation results show that the performance of ProNTO is close to the 
performance of the BCD algorithm. 

VIL Numerical and Simulation Results 

In this section, we present the numerical and simulation results related to PTF and ProNTO 
heuristics. Throughout our simulations, we use the folowing setup: W = IkHz, = 10''^W/ Hz. 
We assume that some amount of energy (e < < 00 where e is an infinitely small value) 
is harvested every 10 seconds (T = 10), within a frame (period of known harvests). Note 
that, throughout this section, the units used for frame length, energy, and power are; seconds. 
Joules, and Watts respectively. Throughout our simulations, we use four different frame lengths; 
20, 80, 100, 120. For the frame of 20 sees, we use three different energy harvest models; 
[0.5,50], [50,0.5], [6020]. We define different cases for the remaining three frame lengths; 

DRAFT 



13 

Regular, Bursty, and, Very Bursty. In Regular, the harvest amounts are close to each other 
and form a regular pattern; Er = [73,65,9,19,40,37,22,84,39,67,81,100]. In Bursty, there 
are short term sudden decreases and increases in harvest amounts, causing a bursty pattern; 
Eb = [20, 100, 1, 1, 1, 70, 100, 1, 10, 40]. Finally, Very Bursty represents an extreme case where 
the transmitter stays energy-hungry for a long time; Ey = [90, 2, 0.5, 0.1, 0.3, 0.7, 40, 60]. 

We start by the simplest case of two users and two slots (A^ = 2, K = 2, frame of 20 sees) 
to compare the results obtained by BCD algorithm [|9l, with the optimal ones presented in Table 
m Our objective in doing such a comparison is to prove the accuracy of both theoretical and 
simulation results. We refer the interested reader to Appendix O for the details of the optimality 
table, and provide the comparison in Table III Note that as in |l9l, the starting point of the 
algorithm is the Spend What You Get (SG) policy (proposed by Gorlatova et. al. 01211 ) combined 
with TDMA time allocation (SG+TDMA). The first column of Table HI] shows the amount of the 
harvests (Ei, E2). The second column represents the mean path loss (in dB) of the two users. 
As observed from the table, for a given power allocation, the results found by BCD algorithm 
and the optimal ones (obtained by KKT optimality conditions) are almost the same, verifying 
the consistency and optimality of the algorithm. 

The attentive reader can observe from Table H] that, when harvests decrease from one slot 
to another, the optimal powers tend to be nondecreasing. Hence in that case, the algorithm 
seems to be converged to the nondecreasing optimal discussed in Theorem [U Note that, this 
nondecreasing optimal could also be obtained by using the modification method explained in 
Lemma [U By using that method, we modify the results obtained by BCD algorithm to reveal the 
optimal (nondecreasing) power and time allocation policies for increasing number of users. For 
our analysis, we use three different path loss patterns, called. Low, Moderate, High respectively. 
In Low, the strongest user in the system has 13 dB path loss, and, every new user that joins the 
system deviates by 3 dB from the previous one (has 3 dB more path loss than the preceding 
user). In Moderate, the strongest user has 19 dB path loss, and, every new user deviates by 3 
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dB. Finally, in High, the strongest user has 25 dB path loss, and, every new user deviates 3 dB. 
Due to space limitations, we present only the Bursty-Moderate case's results in Table Hill As 
illustrated, when the number of users increase, BCD algorithm tends to assign increasing powers 
rather than nondecreasing. One can also see from the table that, no matter how many users exist 
in the system, ordering powers in nondecreasing order, causes the time allocation to be ordered 
too. By ordered, we mean that the first slot(s) are allocated to the user with the best channel, the 
next slot(s) are allocated to the user with the second best channel, etc. , and the last slot(s) are 
allocated to the user with the worst channel. This observation constitutes the main motivation 
for the ProNTO heuristic. 

We next use the above-mentioned energy harvesting cases (Regular, Bursty, Very Bursty) to 
compare the PTF and ProNTO heuristics' performances to that of BCD's. We start by testing 
the utility and throughput improvement (over SG+TDMA) performances of the heuristics for 
increasing path losses. For this, we set the number of users to two, i.e., N = 2. The results 
are presented in Figure |2] and Figure |3] respectively. In both figures, the Mean Path Loss, is 
computed as L = J2iLi where Li represents the path loss of user i. Hence, the three mean 
path losses seen in the figures represent the Low, Moderate and High cases. One can observe 
from Figure |2] that, the utility improvements of all algorithms tend to increase (or at least stay 
constant) when path loss increases, and the utility improvement performances of the proposed 
heuristics are very close to that of BCD's. For the chosen cases, ProNTO outperforms PTF. This 
is more obvious for the Very Bursty case. The corresponding throughput improvements are shown 
in Figure |3l As illustrated, for the case of = 2, even with ^ 5% of utility improvement, a 
~ 65% of improvement in total throughput is possible. Note that, in all cases, the performances 
are very close to each other. 

In order to determine the effect of number of users to the performances of our proposed heuris- 
tics, we next perform a series of simulations by considering all energy harvesting cases (Regular, 
Bursty, Very Bursty) and different number of users. By taking average over all energy harvesting 
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cases, we present the average utility improvement results in Figure |4l for the Moderate case. As 
illustrated in the figure, when the number of users increase, the average utility improvements 
of all schemes also increase. Note that, both heuristics closely track the BCD algorithm. When 
there are few users in the system, PTF and ProNTO are competitive. However, when there are 
more users, ProNTO seems to outperform PTF in terms of average utility improvement. At all 
instances, ProNTO is within the 1% neighbourhood of the BCD algorithm. 

Although we aim at proportional fairness in this work, it may be interesting to analyse max- 
min fairnesses of the proposed algorithms, PTF and ProNTO. Jain's index is a well-known 
measure of fairness [[T3l . [[T4l . The index FI takes the value of 1 when there is a complete fair 
allocation. 



FI = , (17) 



For computing FI, we use the no. of bits transmitted to the users, Xi = 2^^ for i = 1, . . . , A^, 
where Ui is as defined in Eq. (|9l). From Table |IVl it is clear that SG+TDMA is the worst choice 
in terms of fairness. Although low path losses embrace lower utility improvement, they mainly 
allow both PTF and ProNTO to be very efficient in terms of fairness. However, as observed 
from the table, when all three cases are considered, PTF seems to be more fair than ProNTO is. 
Hence, ProNTO seems to trade of fairness for utility improvement. It can also be inferred from 
Figure |4] and Table |IV] that, when ProNTO outperforms PTF in terms of utility improvement, the 
difference between two heuristics is not high. However, this is not the case for fairness, .i.e., when 
PTF outperforms ProNTO, the difference can be considered as high. Hence, although ProNTO 
seems more promising in terms of utility improvement, depending on system requirements, one 
can still choose PTF over ProNTO for more fairness. 
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VIII. Conclusion 

This paper presented the second part of a study whose first part was reported in [|9l. Building on 
the problem formulation and the optimal solution method in flU, the optimal resource allocation 
policy was further studied and certain structural characteristics of the optimal solution were 
established. In particular, the existence of an optimal nondecreasing power schedule and, an 
ordered time allocation schedule were proved. This allowed us to propose two alternative efficient 
and scalable heuristics, PTF and ProNTO. The computational ease of these algorithms were 
observed in numerical examples, while the policies they result in coincide with the structural 
properties we have shown the optimal to have. Simulation results indicate that, despite their 
simplistic design, PTF and ProNTO heuristics can closely track the performance of the optimal 
BCD algorithm. In our examples, which were computed for small or moderate problem sizes, 
both PTF and ProNTO took one or two orders of magnitude smaller time to converge than 
BCD, which has to compute a Hessian. Typically, ProNTO outperforms PTF in terms of utility 
improvement, whereas the latter is fairer. The utility improvement difference between BCD and 
ProNTO is shown to be less than 1% at all instances. 

An interesting future direction could be the development of an online algorithm that will 
bypass the need for offline knowledge about the energy harvesting statistics. This algorithm may 
use energy harvesting prediction algorithms to predict the energy that will arrive in the future, 
or estimate it on the fly during network operation. 

Appendix A 
Proof of Theorem [H 

The proof is done by contradiction. For any given time allocation r, consider a given power 
sequence, Vc = {pi, ..■,Pd-i,Pd, ■■■■,Pk), in which the power level decreases at some time, say 
d > 1. In such a case, we can defer some energy, < A < pd-iTd-i , from the {d — Vf^ slot to 
the d^^ slot forming a modified schedule, = {pi, ...,p'^_^,p'^, ...,pk), that will not violate the 
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energy causality conditions (as shown in Fig. |5]). Clearly, we can continue this deferral operation 
until p'd_i < v'd ^'^^ ^^"^ '^o'^ violate the energy causality conditions. Applying the same method 
for every possible decrease leads us to a nondecreasing schedule, V\ = {p'l, ...,p'^_^,p'^, 
where pi < P2 < ••• < p'k- 

From Lemma [B U{t,Vc) = Uij'^c ^Vp). Thus, for time allocation Vq is optimal. 

This completes the proof. 

Appendix B 
Proof of Lemma [H 

Let, R'^ = Rn where rJ is as defined in Definition [IJ Note that Eq. (fT2l) forces 

log^il + Lnp\) <...< log2{l + L^p'i) <...< log^il + Kp'j,) (18) 
1 + Lnp\ <...<! + L^p'i <...<! + Kp'k (19) 
p'^<...<p'i<...<p'k (20) 

Hence, sorting i?„ in increasing order, forces nondecreasing powers (ordered schedule 
mentioned previously), which indeed forces all other Ri (where iG{l,...,i — + A^}) 
to be sorted in increasing order, to form R[. Now, we have new rates, R[ for all users i = 
1, . . . , N. Remember that the utility of a user is defined as in Eq. Thus, changing the order 
of Ri vector does not change the value of Ui if the order of 77 is also changed so that the 
previous element pairs are matched again. Let us explain this, with an example. Let Ri2 < Rn, 
Rnk < Ri2, and, Rn < Ris < ... < Ri(K-i)- Then, r/, and, i?- vectors are defined as 
R'i = [RiK Ri2 Rn Riz ■■■ Ri{K-i)Y and r/ = [TiK r^s ... Ti(^K-i)V- Hence, it is 

straight forward to write that 
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r/ R[ = TixRiK + Ti2Ri2 + TiiRii + . . . + Ti(K-l)Ri(K-l) 
= Til-Ril + Ti2-Ri2 + . . . + Ti(^K~l)Ri(K~l) + TixRiK 

= n^Ri (21) 

where 77 and Ri are as defined in Eq. dV]). As it can be observed, Ui = U[ as long as R[ = Ri 
and t[ = iji)^ . Here, indicates the 77 vector ordered according to Ri . Under these 

circumstances, Ui = U- for all z = 1, . . . , A^, and, the overall utility does not change, U = U'. 
This completes the proof. 

Appendix C 
Proof of Lemma [2] 

For the proof of Lemma l2l we use the KKT optimality conditions. Let, An = r*^i?„i + r*2-R„2- 
Then, for the special case, {N = 2 , K = 2), the set of KKT conditions described in IfTTll reduces 
to Eqns. (l22al)-([22gl). 



/i* > (22b) 

<t > (22c) 

<i + <2>e (22d) 

r*, + = T (22e) 

^2(t-l)+n<t = (22f) 

/^:+n«i + <2 - e) = (22g) 
for i = 1, . . . ,6, n = 1,2 and t = 1, 2. Combining the set of equations described above leads 
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US to the following optimality conditions for the time allocation: 



/^2t-i^u = (23a) 

Solving the set of equations in Eq. (l23l) . one can obtain the desired relation between power 
allocation and time allocation, as illustrated in Table H Due to the convex nature of the problem, 
the solutions presented in Table |I] represent the global optima, when the rate improvements of 
the users, r„, are equal. By inspecting Table IB one can observe the properties mentioned in 
Lemma [21 AH cases are summarized in Table HI which completes the proof. 
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Fig. 1. Problem illustration: There are K energy arivals in a frame, and, the time between consecutive arrivals are allocated 
to N users. 
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TABLE I 

Overall Optimality Conditions for the Special Case of Two Users and Two Slots (Ti = T2): Categorized 

ACCORDING TO THE RELATION BETWEEN THE POWERS ALLOCATED IN THE FIRST AND SECOND SLOTS. FOR A GIVEN 
POWER ALLOCATION, THE OPTIMAL TIME ALLOCATION DIFFERS ACCORDING TO THE RELATION BETWEEN THE RATE 

IMPROVEMENTS OF THE USERS. 
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Users' Rate Improvement 
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Slot 2 
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TABLE 11 

BCD VS. Optimal Results for the Special Case of Two Users and Two Slots 



Harvests 


Mean Path loss 


Power Allocation by BCD 


Time Allocation by BCD 


Optimal Time Allocation 


Utility by BCD 


Optimal Utility 


[0.5 50] 


20.5 


[0.0500 5.0000] 


[10 4.4129] 
I S.S87lJ 


[10 4.41291 
I 5.587lJ 


29.8094 


29.8094 


26.5 


[0.0500 5.0000] 


iU 4. / 399 
1- 5.260lJ 


lU 4. / 399 

1- 5.2601J 


28.4062 


28.4062 


32.5 


[0.0500 5.0000] 
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8.5 
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Fig. 2. Utility Improvement (BCD, PTF, ProNTO) vs. Mean Path Loss for N — 2: The effect of mean path loss on utility 
improvement for the three energy harvesting cases; Regular, Bursty, Very Bursty. 
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TABLE III 

Optimal Time and Power Allocation Policies vs. Number of users: Found by BCD algorithm and modified 

ACCORDING to LEMMa[T] 



No. of 

Users 


T.A./ 
P.A. 


Users/ 
Slots 


Slot 1 


Slot 2 


Slots 


Slot 4 


Slots 


Slots 


Slot? 


Slot 8 


Slot 9 


Slot 10 


2 


T.A. 


1 

2. 


1 n 
iU 

U 


1 n 
U 


1 n 
iU 

U 


1 n 
iU 

U 


1 n 
U 


O.DDDD 


U 

1 n 
iU 


U 
lU 


U 

1 n 
lU 


U 

1 n 
iU 




P.A. 




2 


2.5750 


2.5750 


2.5750 


2.5750 


4.2117 


4.4720 


4.4720 


4.4720 


4.4720 






1 


10 


10 


10 


7.1883 











0.0128 










T.A. 


2 











2.8117 


10 


10 


10 


9.9872 








3 




i 


U 


U 


U 


U 


U 


U 


U 


U 


iU 


iU 




P.A. 




2 


2.4151 


2.4151 


2.5531 


2.9166 


3.9160 


4 


4.7272 


4.7283 


4.7283 






1 


10 


10 


8.5777 

























T.A. 


2 








1.4223 


10 


10 


4.7598 














4 


3 

















5.2401 


10 


8.3789 












4 























1.6211 


10 


10 




P.A. 




2 


2.3132 


2.3810 


2.8028 


2.8028 


3.6343 


4.0501 


4.2070 


5.0742 


5.0742 






1 


10 


10 


3.3321 



























2 








6.6678 


10 


5.1071 



















T.A. 


3 














4.8928 


10 


5.2864 











5 




4 




















4.7135 


10 


3.4256 









5 


























6.5743 


10 




P.A. 




2 


2.1351 


2.4281 


2.5792 


3.1523 


3.1574 


3.8909 


4.3727 


5.1247 


5.5592 






1 


10 


10 


0.5333 



























2 








9.4666 


10 






















T.A. 


3 














10 


7.3867 














6 


4 

















2.6132 


10 


3.3768 












5 























6.6231 


7.7727 









6 


























2.2272 


10 




P.A. 




1.8639 


1.8639 


2.2338 


2.2553 


3.0596 


3.2585 


3.8704 


4.0829 


5.3135 


6.5977 






1 


10 


8.9882 






























2 





1.0117 


10 


6.4262 
























3 











3.5737 


10 


2.4354 
















T.A. 


4 

















7.5645 


6.7008 











7 




5 




















3.2991 


10 












6 


























10 


0.6484 






7 





























9.3515 




P.A. 




1.6343 


1.6707 


2.0084 


2.3109 


2.6543 


2.9765 


4 


4.0098 


5.7614 


7.3733 






1 


10 


7.4864 






























2 





2.5135 


10 


3.7891 
























3 











6.2108 


8.9349 



















T.A. 


4 














1.0650 


10 


2.0483 











8 


5 




















7.9516 


3.4281 












6 























6.5718 


3.7485 









7 


























6.2515 


2.4839 






8 





























7.5160 




P.A. 




1.4847 


1.5678 


1.8273 


2.1002 


2.3413 


2.9323 


4 


4.4137 


5.7667 


7.9655 



DRAFT 



24 




Fig. 3. Throughput Improvement (BCD, PTF, ProNTO) vs. Mean Path Loss for N = 2: The effect of mean path loss on 
throughput improvement for the three energy harvesting cases; Regular, Bursty, Very Bursty. 



TABLE IV 

Fairness Index (SG+TDMA, PTF, ProNTO, BCD) vs. No. of Users: The fairness of PTF and ProNTO 

HEURISTICS ARE COMPARED TO THAT OF SG+TDMA'S AND BCD'S, THROUGH FI. 
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Fairness Index (FI) 
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Fig. 4. Average Utility Improvement (PTF, ProNTO, BCD) vs. No. of Users: The average is taken over Regular, Bursty, Very 
Bursty cases. The average utility improvements of the proposed algorithms over SG+TDMA, for increasing number of users, 
are compared. Utility improvment increases with increasing number of users. 
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Fig. 5. Maintaining Energy Causality After Energy Deferral 
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